CLG Authorship Analytics: a library for authorship verification
نویسندگان
چکیده
The task of authorship verification consists in detecting whether two texts have been written by the same person. This paper describes CLG Authorship Analytics software, which implements several individual methods as well a stacked generalization system for verification. approach relies primarily on ensemble learning methods, i.e. repeatedly sampling data order to capture invariant stylistic patterns. is tested through series experiments designed test ability generalize, depending various parameters. code and results are publicly available https://github.com/erwanm/clg-authorship-experiments .
منابع مشابه
Distractorless Authorship Verification
Authorship verification is the task of, given a document and a candidate author, determining whether or not the document was written by the candidate author. Traditional approaches to authorship verification have revolved around a “candidate author vs. everything else” approach. Thus, perhaps the most important aspect of performing authorship verification on a document is the development of an ...
متن کاملText Categorization for Authorship Verification
Abstract. One common version of the authorship attribution problem is that of authorship verification. We need to determine whether a given author, for whom we have a corpus of writing samples, is also the author of a given anonymous text. The set of alternate candidates is not limited to a given finite closed set. In this paper we show how usual text categorization methods can be adapted to so...
متن کاملA Profile-Based Method for Authorship Verification
Authorship verification is one of the most challenging tasks in stylebased text categorization. Given a set of documents, all by the same author, and another document of unknown authorship the question is whether or not the latter is also by that author. Recently, in the framework of the PAN-2013 evaluation lab, a competition in authorship verification was organized and the vast majority of sub...
متن کاملLinguistic Profiling for Authorship Recognition and Verification
A new technique is introduced, linguistic profiling, in which large numbers of counts of linguistic features are used as a text profile, which can then be compared to average profiles for groups of texts. The technique proves to be quite effective for authorship verification and recognition. The best parameter settings yield a False Accept Rate of 8.1% at a False Reject Rate equal to zero for t...
متن کاملAn Improved Impostors Method for Authorship Verification
Authorship verification has gained a lot of attention during the last years mainly due to the focus of PAN@CLEF shared tasks. A verification method called Impostors, based on a set of external (impostor) documents and a random subspace ensemble, is one of the most successful approaches. Variations of this method gained top-performing positions in recent PAN evaluation campaigns. In this paper, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Digital Humanities
سال: 2022
ISSN: ['2524-7832', '2524-7840']
DOI: https://doi.org/10.1007/s42803-022-00051-w